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In this paper we characterize all the r-parameter families of count distributions (satisfying mild 
conditions) that are closed under addition and under binomial subsampling. Surprisingly, few 
families satisfy both properties and the resulting models consist of the rth-order univariate 
Hermite distributions. Among these, we find the Poisson (r = 1) and the ordinary Hermite 
distributions (r — 2). 
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1. Introduction 

An important problem in data analysis is how to choose an adequate family of distribu- 
tions or statistical model to describe the values observed in any study. For this purpose 
the characterization theorems can be useful because, under general reasonable supposi- 
tions related to the nature of the experiment or the data, they allow us to reduce the 
possible set of distributions that can be used. One of these reasonable assumptions is that 
the model is closed under addition, that is, closed under convolutions. This means that 
the distribution of the sum of independent random variables with distributions belonging 
to the model also belongs to the same model. The property of closure under addition 
has widely been used to characterize families of distributions, the papers of Tcichcr [16] 
and Godambe and Patil [4] being particularly noteworthy. Other properties that can be 
utilized in characterization theorems jointly with additivity involve assumptions about 
the maximum likelihood estimators of the parameters. Puig [13] and Puig and Valero 
[14] characterize count families of distributions using several notions of additivity and 
assuming that the maximum likelihood estimator of the population mean is the sample 
mean. 
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In this paper we characterize count models using the additional notion of closure under 
binomial subsampling or under independent p-thinning. This property, studied in Wiuf 
and Stumpf [18], is properly defined in Section 2. The notion of binomial subsampling 
is quite natural in practice. Examples from mathematical biology can be found in Wiuf 
and Stumpf [18]. Roughly speaking, the property of closure under binomial subsampling 
means that if a count experiment is governed by a model with this property, the same 
family of distributions can be used to fit the data sets independently of the degree of loss 
of information due to the subsampling action. 

From a result of Wiuf and Stumpf [18] one can directly establish the structure of the 
probability generating function of any count statistical model, with finite moments of 
all orders, that is closed under binomial subsampling. Corollary 1 in Section 2 extends 
this result to the first r finite moments. Proposition 2 in Section 2 establishes that these 
models arc constituted by mixed Poisson distributions and distributions of independent 
p-thinnings of random variables that do not come from any particular variable by in- 
dependent p-thinning. Following Thcdccn [17], these random variables that cannot be 
generated from an independent p-thinning of any random variable will be called top 
inverses. In order to illustrate all these results, three examples are given in Section 2. 

The main result of this paper is stated in Theorem 1 in Section 3: all the r-parameter 
families of count distributions (satisfying mild conditions) that are closed under addition 
and under binomial subsampling arc found. These are the rth-order univariate Hcrmitc 
distributions of Milne and Westcott [11]. The case r = 1 corresponds to the Poisson 
model, and r = 2 is the ordinary Hcrmite family of distributions. 

2. Count data models closed under binomial 
subsampling 

We begin with a definition that can be found in Grandell [5], page 25. 

Definition 1. Let X be a count random variable, let £i,£2, ■ ■ ■ be independent and iden- 
tically distributed Bernoulli variables with probability of success p, for some p G (0,1], 
and let X be independent of the £j . The count random variable 

-Xp = £L+£2 + --- + £x (X p = 0ifX = 0) 

is called an independent p -thinning of X. 

The concept of p-thinning is widely used in point process theory, and it is also use- 
ful to describe count random variables coming from practical scenarios where a certain 
loss of information occurs. The random variable X p can be understood as a binomial 
subsampling of X. For instance, suppose that X is the random variable which counts 
the number of seeds falling in a plot. Assume that each seed has the same probability p 
of germinating and becoming a plant, and that for all seeds these actions are mutually 
independent and independent of the total number of seeds. Then the number of plants 
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in the plot is X p , a binomial subsampling of the number of seeds. The concept of p- 
thinning can be naturally generalized for multidimensional count random variables by a 
componentwise action. For instance, given a bivariate count random variable X = (Y, Z), 
an independent p-thinning (p = (r, s) £ (0, l] 2 ) can be defined as X p = (Y r , Z s ), where 
Y r and Z s arc independent r- and s-thinnings. It is interesting to remark that if p is 
the correlation coefficient of the two components of X and p p the correlation coefficient 
of the components of its p-thinning, then /? 2 < p 2 , and equality holds in the trivial case 
when r = 1 , s = 1 . Consequently, the p-thinning action diminishes the linear association 
of the components. 

Let &x(t) be the probability generating function (pgf) of a one-dimensional count 
random variable X. Recall that any pgf is a real analytic function, at least in t £ (— 1, 1). 
Moreover, if X has finite moments of all orders, then 3>x{t) is C°° at t = 1 and <j>^'(l) = 
'E(X(X — 1) • • • (X — k + 1)) = fi(k) ' that is, its fcth factorial moment. When X has finite 
moments of all orders, the fcth derivative of log($x(*)) at t = 1 is known as the fcth 
factorial cumulant of X, namely K(fe), and they are also quantities of interest for our 
purposes. 

The pgf of an independent p-thinning or binomial subsampling of X is (Grandell [5], 
page 25): 

*x,(t) = *x(l-p(l-t))- (1) 

The following lemma shows some elementary properties of an independent p-thinning 
that will be useful later on. The lemma follows immediately from (1). 



Lemma 1. Let X be a count random variable with finite moments of all orders, and 
let and Kha be its kth factorial moment and cumulant, respectively. Its population 
mean p^ = K{\) will be denoted by p. Let p* k ^ or K* k ^ denote the kth factorial moment 
or cumulant for X p , a binomial subsampling of X . Then p* k ^ =p k p^ and =p k K{k)- 
Moreover, the quantities fi(k)/ ' and k^/^ are invariant under binomial subsampling, 
that is, /i(fc)//i fc = fi* k ^/(iJ*) k and K^//j, k = K^/(/j,*) k for any binomial subsampling. 

Using the notation r\i = and the well-known relationships between the 

factorial cumulants and the ordinary cumulants (for instance, see Appendix 7 of Douglas 
[1]) the expressions below follow immediately: 

a 2 — [i K3 — 3er 2 + 2u K4 — 6^3 + lltr 2 — 6p, . 

Vi = — > V2 = 3 , Tfy= 4 ) ( 2 ) 

p z p a pr 

where a 2 , K3 and K4 denote the variance and the third and fourth cumulants, respec- 
tively Using the same technique we could find other quantities which are invariant under 
binomial subsampling involving higher-order cumulants. However, the cumulants of order 
more than 4 are not very useful from the statistical point of view. 

Consider now a count statistical model, that is, a set of probability functions T of 
count random variables indexed by the parameter 9 £ 6 C R k . Suppose that the model 
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satisfies some minimum regularity conditions with respect to the parameters, that is, the 
model can be identified by a continuous function from (0, 1) x 9 C R fc+1 into R, namely 
$(i; 9), where, for each fixed value of 8o E 9, 6q) is a pgf. Moreover, we also assume 
that the parameter domain 9 is the widest possible. That is, if $*(i; 9) is any continuous 
function from (0, 1) x 6* C R k+1 into R, 9 C 9*, such that 0) = 9) for all G 9, 
then for each 9 E 9* \ 9, <&*(t;6*o) is not a pgf. The following definition, given in Wiuf 
and Stumpf [18], will be essential for our purposes. 

Definition 2. Let J- be a count statistical model. It will be called closed under binomial 
subsampling or under independent p-thinning if, for any random variable X with dis- 
tribution belonging to the model, all its independent p -thinnings, for any p E (0,1], have 
distributions that also belong to the model. 

The Poisson distribution, the most frequently used statistical model to analyse count 
data, is closed under binomial subsampling. By way of explanation, for any Poisson ran- 
dom variable X, any independent p-thinning is also a Poisson random variable. However, 
the zero truncated Poisson distribution is not closed under independent p-thinning for 
any p E (0, 1] except for the trivial case p = 1. Following Thcdeen [17], a random variable 
Y that is not an independent p-thinning of any random variable is called a top inverse. 
It is obvious that Y is a top inverse if its distribution has at least one zero-gap, that is, 
if there exists < k < n such that P(Y = k) = while P(Y = n) > 0. 

Models closed under binomial subsampling are very significant in practice. If a count 
experiment is ruled by one of these models, it means that the same family of distributions 
can be used to fit the data sets independently of the degree of the loss of information. 
The following result describes their pgfs: 

Proposition 1. Let J- be a statistical model parameterized by the population mean p and 
described by the set of pgfs <!>(<; p). Then it is closed under binomial subsampling if and 
only if $(t;p) — g(p-(t — 1)), for a certain real analytic function g{x) and 9 = (0,mm]> 
\xm < oo or 9 = (0, oo ) . 

Proof. We first prove sufficiency. Let X be a random variable having a pgf of the form 
g(fJ,o(t — 1)), where E(X) = po, Mo E 9- Direct calculations from (1) show that the pgf 
of any p-thinning of X is precisely g(pfio(t — 1)). Consequently, p/xo E (0,/xm] and the 
model is closed under binomial subsampling. 

Turning to necessity, consider a fixed /xo E 9 (here 9 means the interior of 9), where 
Mo > because it represents the population mean of a count random variable. Any < 
M < Mo belongs to 9 because T is closed under p-thinning and fi is the expectation of an 
independent p-thinning (p = m/mo) of a random variable whose pgf is $(£; Mo)- In terms 
of their pgfs, by (1) this is equivalent to the identity 




(3) 



548 



P. Puig and J. Valero 



We define g^x) = <E>(1 + x/^ ;fi ), which is real analytic for |1 +x//iq\ < 1 or, equiva- 
lently, for — 2/i < x < 0. Then (3) can be rewritten as 

$(*;m) = 5/. 0(*-i)). 

and this is valid for any < (i < /io . 

Consider now a fixed \i\ eO, /Lti > /io- Repeating the preceding argument, defining 
g^ix) = $(1 + x/fii;fj,i), we conclude that $(i;/i) = — 1)), for any < /i < /Xi. 

Consequently g^ {^{t — 1)) = g tll {n(t — 1)), for < /i < /io- Since g^ (x) and <7 Ml (x) are 
real analytic and they overlap in the open set — 2/io < a; < 0, we obtain g fi0 (x) = g fil (x) = 
g(x) and therefore the function g(-) does not depend on the initial values of /io or fii. 
It follows directly that <&(i; /i) = </(/i(i — 1)), for any /i£9, and 8 must be of the form 
(0, iim) where /j,m can be oo. In order to conclude that © = (0, hm], it is enough to prove 
that if — 1)) is a pgf, for any < /i < (1m < oo, then g(fiM(t — 1)) is also a pgf. This 
is immediate because g(x) is real analytic in — 2/ijv/ < a; < 0. Then it can be expanded in 
a power series around —[Lm as 

k=0 

being convergent for —fJ,M < x < 0. Making x = /iA/(£ — 1), we obtain 



oo 



5 (MM(i-i)) = E^r^^' 

k=0 



which is real analytic for — 2 < t < 1. On the other hand, we know that <?(/i(i — 1)) = 
Efcloff (fc) (^A 1 )/ i ' c ^/ fc! is a pgf for any </x< /i M - Consequently, Y,T=o 5 W ( _ M)£«V fc! = 
1 and g^ (— /i) n k / k\ > for fc = 0, 1, 2, . . . , for any < /i < fiM- By continuity we con- 
clude that J2T=o g (k) (-^Vm/M = 1 and gW (-/i M )/4// fc! > for A; = 0, 1, 2, ... , which 
completes the proof. □ 

The result of this proposition can easily be extended to count models with more than 
one parameter, such that their vector of parameters has the form 9 = (£t,£i,£2, ■ • ■ ,£r), 
where any fj, i = 1, . . . ,r, is invariant under independent p-thinning. These could, for 
instance, be the quantities /i^)/// or K/ i \/fj,' > specified in Lemma 1 or the quantities in 
(2). This extension is specified in the following corollary: 

Corollary 1. Let J- be a statistical model parameterized by 6 =(//,£) 6 C M. r , where fi 
is the population mean and £ — (£2,63, • ■ ■ , £r) is invariant under binomial subsampling. 
Let (fr(t;9) be the set of pgfs that describes the model. Then it is closed under binomial 
subsampling if and only if = g(fJ,(t — for certain real analytic functions 

g(x,£,). Moreover, fixing £ = £o ; the domain of fi is of the form (0,/i^J, fi^ < 00, or 
(0,oo). 
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Remark. Proposition 1 can also be proved by using Theorems 2.4 and 2.6 of Wiuf and 
Stumpf [18]. Theorem 3.1 of these authors is equivalent to our Corollary 1 when it is 
assumed that the distributions of the model have finite moments of all orders and these 
moments determine the distribution. Notice that Corollary 1 is more general. 

In order to have a better understanding of the structure of the models which are closed 
under binomial subsampling, we state the following result that is an extension of Theorem 
2.3 of Grandell [5]. 

Proposition 2. Let J- be a statistical model, closed under binomial subsampling, param- 
eterized by 9 = G 6 C R r , where the parameter £ is also invariant under indepen- 
dent p-thinning. Consider a random variable X whose distribution belongs to T ; then it 
satisfies one of the following two assertions: 

(i) X is mixed Poisson distributed. 

(ii) X is an independent p-thinning of a random variable, say Y , whose distribution 
belongs to T such that Y is a top inverse. 

Proof. By Corollary 1, fixing £ = £o, the domain of yU is of the form (0,oo) or (0,/i^J, 
fi£ < oo. The two assertions depend on whether the right extreme of the domain of \x 
is included or not. Let X be a random variable whose distribution belongs to J- and 
E(X) = W 

(i) Consider that the domain of fj, is of the form (0,oo). Then X can be obtained 
by independent p-thinnings of random variables whose distributions belong to 
J-, having parameter 9' = (po/Pi£o) for any p £ (0,1]. Then, by Theorem 2.3 of 
Grandell [5], X is mixed Poisson distributed. 

(ii) If the domain of /z is of the form (0, /U< ], fio < /i£ , X is an independent p-thinning 
of Y whose distribution has parameter (At£ ,£o); for p = /xo//x^ . Notice that Y is 
not an independent p-thinning of any random variable in T except itself. Moreover, 
this is not an independent p-thinning of any random variable because, as the 
domain is the widest possible, such a random variable would belong to T . 

□ 

What distributions characterize the class of top inverse random variables? The exis- 
tence of a zero gap is a sufficient but not a necessary condition for a random variable 
Y to be a top inverse. However, if the distribution of Y has compact support this con- 
dition is also necessary (Corollary 3.1 in Thcdeen [17]). Example 3 below shows how to 
construct top inverses without zero-gaps. As a referee has pointed out, further research 
on characterizations of the class of top inverses would be very interesting. Yannaros [19] 
has some results related to inverses of thinned renewal processes. 

We conclude this section with some examples of models closed under binomial sub- 
sampling and show how Propositions 1 and 2 and Corollary 1 can be used. 
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Example 1. This example constitutes an interesting application of Corollary 1. Consider 
the Hermite distribution introduced by Kemp and Kemp [7, 8] . A convenient parametriza- 
tion is by means of its population mean and variance. Its pgf has the form 



The domain of parameters is = {(/j, a 2 ) : p > 0, \x < a 2 < 2/i}. Changing the parameters 
to (yu, ?7i ) by using (2), we obtain 



where the domain of parameters is now 0' = {(/z, 771 ) : r/i > 0,0 < p < 1/771}. Notice that 
/i, 771) = g(n(t — 1), 771), where g(x; rji) = exp(r)ix 2 /2 + x). Consequently the Hermite 
family is closed under binomial subsampling. It is known that the Hermite distribution is 
not a mixed Poisson model (Kemp and Kemp [7] ) . It can easily be shown that any Hermite 
random variable with fixed /Jo and rji can be obtained by independent p-thinning from 
another random variable with pgf 



with p = pi]\. This pgf corresponds to a random variable of the form 2X, where X is 
Poisson distributed, and this is a top inverse random variable because P(2X = k) = for 
k = 1, 3, 5, .... It is in concordance with Proposition 2. 

Example 2. Consider now the negative binomial family of distributions parameterized 
by its population mean and variance. Its pgf has the form 



The domain of parameters is = {(/i, a 2 ) : /1 > 0, /u < cr 2 }. Changing the parameters to 
(/x, 771) as in Example 1, we obtain 



where the domain of parameters is now 0' = > 0,771 > 0}. Notice that 

$(/;; /1, 771) = g{[i{t — 1), 771), where g(x; 771) = (1 — 771 x)^ 1 ^ 1 . Corollary 1 again shows that 
the negative binomial family of distributions is closed under binomial subsampling. 

This family is a mixed Poisson model. Notice that any negative binomial random 
variable with parameters (/^Oj^io) is an independent jj-thinning of a negative binomial 
random variable with parameters {no/p, »7i ), for any p £ (0, 1]. 







$(t;/i,77i) = (l-A»/i (t-1)) 
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Example 3. Consider a count random variable X such that pk = P(X = k) > for k = 
0, 1, 2, . . . and P2k+i/p2k (or P2k/p2k-i) tends to infinity when k — > oo. Our aim is to prove 
that X is a top inverse. Suppose that there exist a random variable Y such that X can be 
obtained as an independent p-thinning of Y; we will arrive at a contradiction. According 
to (1), the pgf of Y is <&y(t) = $>x (1 + (t — l)/p)- Consequently, the probabilities p* k = 



» re ^ V fc / V P 



P(Y = fc), fc = 0,1,2,..., satisfy 

ri = t!4"(0) = ^(i-i)=^g 

Adding each pair of consecutive terms, we arrive at the expression 

* ft , U-V/' 2l + fc ^ 2t (, 2i + k + 1 P2i+k+l\ 

Pfc = (1 + <0 §1 * J" ^l 1 " 6 * + l P2t+fc J' 

where £ = — 1 + 1/p. Therefore, for any e > 0, there exists a r such that, for all i > 0, 
P2i+r+i/p2i+r > 1/e. Consequently p* < 0, which is a contradiction. 

Now many examples of top inverses without zero gaps can be construct ed by alternating 
two series of positive terms. For instance, this is the case for the top inverse random vari- 
able X defined by the probability function p 2 fc = c(l/3) fc , P2k+i = c (l/2) fc , k = 0, 1,2, . . ., 
where c = 2/7 is the normalizing constant. Its pgf is $x(*) = 6/(21 - It 2 ) + 4t/(14 - It 2 ). 
According to (1), the statistical model constructed by binomial subsampling of X is de- 
scribed by the pgfs = 6/(21 - 7(1 -p(l - 1)) 2 ) + 4(1 -p(l - t))/{U - 7(1 -p(l - 
£)) 2 ), where p e (0, 1]. It is immediately evident that the population mean is p = 15p/7 
and, consequently, the model parametrized by p is described by the pgfs <&(i;/n) = 
g(p(t-l)), where g(x) = (900 + 420x)/(3150 - 7(15 + 7x) 2 ) + 1350/(4725 - 7(15 + 7x) 2 ). 
Proposition 1 shows that this model is closed under binomial subsampling. Notice that 
the domain of parameters is n S (0, 15/7] and cannot be extended. This is again in con- 
cordance with Proposition 2. 



3. Characterization 

In many applications it is reasonable to use count models which have the property of 
conservation under addition of independent effects, that is, their distributions are closed 
under convolutions. 

Definition 3. Let J- be a statistical model. It is said to be closed under addition if, 
for any two independent random variables X , Y with distributions belonging to T , the 
distribution of X + Y" also belongs to T . 

For instance, this property is satisfied by the Poisson model or by the Hcrmitc family 
of distributions (see Puig [13]). A quite natural parameterization of a model closed under 
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addition is by means of its cumulants. This is due to the fact that the ith cumulant of 
the sum of two independent random variables is the sum of the corresponding ith cumu- 
lants. It also occurs with factorial cumulants, which therefore provide another reasonable 
parameterization of a model closed under addition which is convenient for our purposes. 
The following proposition can be proved immediately using a methodology similar to 
that of Teicher [16] concerning the structure of the characteristic function of a model 
closed under addition. 



Proposition 3. Let T he a count statistical model parameterized by its r first factorial 
cumulants 9 = (/t(i) , K (2)i • ■ ■ i K (r))> with a pgf continuous in 9 £ O C M. r . The model is 
closed under addition if and only if its pgf can be expressed as 

3(t;0)=eaq>j^ai(t)K (i) j, (4) 

where a% (t) are functions not depending on the parameters. 

Proof. It can immediately be seen that the form of the pgf (4) implies that the model 
is closed under addition. 

Suppose now that the model is closed under addition. This means that, for any pair 
of values 0±, 92 GO, we have 

$(i;0i + 2 ) = $(i;0i)$(i;0 2 ). 
Taking logarithms and fixing £, we obtain 

f(0i + e 2 ) = f(e 1 ) + f(e 2 ), (5) 

where f{9) = log(<I>(i; 9)). But (5) is the multidimensional Cauchy functional equa- 
tion. It is known that its solution has the form, f(6) = Y]a—i ctj6j, where the on do 
not depend on 9. However, they can depend on t, and this leads to the solution 
log($(i; 9)) = J2i=i a i{t)9i, thus concluding the proof. □ 

Proposition 1 and Corollary 1, together with Proposition 3, allow us to find all the count 
statistical models satisfying mild conditions, such that they are closed under addition and 
binomial subsampling. Surprisingly there do not exist many types of models satisfying 
both properties. 

Theorem 1. Let J- be a count statistical model parameterized by its r first factorial 
cumulants 9 = (rem, /t( 2 ), ■ ■ ■ , ft(r))- Assume that the model has a pgf continuous in 9 & 
6cl r . The model is closed under addition and under binomial subsampling if and only 
if it has a pgf of the form 



$(t;0)=exp j£M(t_l)*L. ( 6 ) 



, i=l 
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Proof. We first prove sufficiency. The form of the pgf (6) implies, by Proposition 3, that 
the model is closed under addition. Notice that this pgf can also be reparameterized as 



where £ = (£2, ■■•,&■) an d & = Ku)/^ 1 . Then the pgf can be written as = 
g(n(t— 1);$), where g(x;£) = exp(x + J2l=2 Ci^AO- Using Corollary 1, we conclude that 
the model is closed under binomial subsampling. 

Turning to necessity, suppose that the model is closed under addition and under bino- 
mial subsampling. Again using the parameterization where & = Kuy/fx 1 , by Propo- 
sition 3 and Corollary 1 we obtain the identity 



that is, a polynomial of degree r in x. Since it must not depend on the choice of to, we 
obtain that the functions oii(t) in (4) have the form ai(t) = c%(t — l) 1 , for i = 1, . . . ,r, 
where the Cj are constants. But these constants are Cj = because the Taylor series 
coefficients of log($(£; 0)) at t = 1 must be exactly Ku\/i\, i = l,...,r. This concludes the 



The preceding theorem shows that, under mild conditions, the only statistical model 
closed under addition and under binomial subsampling, such that it can be parameterized 
by its population mean (r = 1), is the Poisson model. 

Kemp and Kemp [8] present an elementary proof showing that the only count dis- 
tribution such that its pgf has the form (6), having a second-degree polynomial in the 
exponent, is the Hcrmitc distribution. Consequently, the two-parameter case (r = 2) de- 
scribed by Theorem 1 corresponds to the Hermite family of distributions. 

Notice that the polynomial of degree r that appears in the exponent of (6) can always 
be expressed as X)I=i a i(^ ~ !)■ This leads to an alternative parameterization that allows 
us to represent the pgf of the count models characterized by Theorem 1 in the form 





Fixing £ = £0 an d t = to and taking x = fx(to — 1), we obtain 




proof. 



□ 





This representation is interesting because, if the a.^ are non-negative, any count random 
variable with a pgf as in (7) can be understood as a linear combination of independent 
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Poisson random variables (Feller [2], pages 291-292). Specifically, it can easily be shown 
that any count random variable X = Xa=i iXi> where Xi are independent Poissons with 
mean a,, has a pgf of the form (7). However, it is possible for Q(t;a) to be a pgf even 
when some of the at in (7) are negative, but in this case these pgfs are not infinitely 
divisible (see Levy [9]; Lukacs [10], page 251; Milne and Westcott [11]). 

Milne and Westcott [11] designate these distributions as generalized Hermite or rth- 
order univariate Hermite. We prefer the second name because of Gupta and Jain's [6] use 
of the name 'generalized Hermite distribution', related to the specific case in (7) where 
only a\ and a r are non-zero. 

3.1. The rth-order univariate Hermite distribution 

Let X be a count random variable with a pgf as in (7). Consider its probabilities, pk = 
P(X = k), k = 0, 1,.... Notice that $(t; a) = J2T=oPk tk = cx p{ELi a i(^ ~ !)}• Then ; 
taking logarithms and differentiating with respect to t, we obtain the identity 

r 

Pi + 2p 2 t + 3 P3 t 2 +--- = (p +pxt + P2 t 2 +■••)£ *a»* i-1 - 

j=i 

Equating the terms with the same degree in t in both parts of the identity, we obtain 
the following lemma: 

Lemma 2. Let X be a count random variable with a pgf as in (7). Then its probabilities 
can be calculated from the recurrence relation p^ = Q2 i=i ipk-iOi) / 'k , k = 1, 2, . . . , with 

PO = exp{- J2i=l a i) and P-l =P-2 = ■■■ =Pl-r = 0. 

For the case r = 2 (Hermite distribution) , this particular recurrence relation was found 
by Kemp and Kemp [7]. Notice that Lemma 2 is a version of the Panjer recursion formula 
(Panjer [12]). 

The parameterization in terms of the a; is particularly worthwhile in order to calculate 
the probabilities. Moreover, it can be easily shown that the cumulants can be directly 
calculated from the ai by means of the expression, k s = i s Qi- When all the a, are 
non-zero, it is known (see Remark 1 in Milne and Westcott [11]) that if at least oi, 
a r _i and a r are non-zero, then a\ > 0, a r _i > and a r > are necessary conditions for 
(7) to be a pgf. Consequently, the full domain of parameters for r = 2 and r = 3 does 
not include negative values for any <Zi but, for r > 4, some of these coefficients could be 
negative. The domain of the parameters that will be considered subsequently for practical 
applications is = {a = (ax, ■ ■ ■ ,a r ) :ai > 0}. This ensures that the distributions are 
infinitely divisible. Moreover, when all the a; are non-negative, it can easily be checked 
that the distribution is overdisperscd, that is, the variance of the count variable exceeds 
its mean. Consequently, these models can be used in practice to analyse data sets when 
we encounter the phenomenon of overdispersion. 

In order to choose the appropriate order r of the model, a hierarchical approach based 
on the likelihood ratio test can be used. In our context this means testing the hypothesis 
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Ho:a r = 0. It is important to take into account that, under the null hypothesis, the 
likelihood ratio test statistic does not have an asymptotic \\ distribution as usually 
happens, because a r = belongs to the boundary of the domain of parameters. It can be 
established that in this situation the asymptotic distribution of the likelihood ratio test 
statistic is a 50:50 mixture of the constant zero and the \\ distribution (Self and Liang 
[15]; Geyer [3]). The a upper tail percentage points for this mixture are the same as the 
2a upper tail percentage points for a Xi distribution. 
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